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Abstract 

A quantum compiler is a software program for decomposing ("compiling") an arbi- 
trary unitary matrix into a sequence of elementary operations (SEO). Coppersmith 
showed that the iV^-bit Discrete Fourier Transform matrix Uft can be decomposed 
in a very efficient way, as a sequence of order(iYs 2 ) elementary operations. Can a 
quantum compiler that doesn't know a priori about Coppersmith's decomposition 
nevertheless decompose Uft as a sequence of order(iYs 2 ) elementary operations? In 
other words, can it rediscover Coppersmith's decomposition by following a much more 
general algorithm? Yes it can, if that more general algorithm is the recursive appli- 
cation of the Cosine-Sine Decomposition (CSD). 
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1 Introduction 



In quantum computing, elementary operations are operations that act on only a few 
(usually one or two) qubits. For example, CNOTs and one-qubit rotations are ele- 
mentary operations. A quantum compiling algorithm is an algorithm for decomposing 
("compiling") an arbitrary unitary matrix into a sequence of elementary operations 
(SEO). A quantum compiler is a software program that implements a quantum com- 
piling algorithm. 

Henceforth, we will refer to Ref. PQ as Tuc99. Tuc99 gives a quantum compiling 
algorithm, implemented in a software program called Qubiter. The Tuc99 algorithm 
uses a matrix decomposition called the Cosine-Sine Decomposition (CSD) which is 
well known in the field of Computational Linear Algebra ;2J. Tuc99 uses CSD in a 
recursive manner. Henceforth we will refer to the recursive application of CSD as 
re-CSD or reap-CSD. 

A modest desideratum for a quantum compiler is that it should recognize when 
a matrix is a tensor product of one-bit operators, and decompose such a matrix into 
a tensor product of one-bit operators. Qubiter does this for the iVe-bit Hadamard 
matrix Hn b . 

In Ref. [3], Coppersmith showed how to express the iV^-bit Discrete Fourier 
Transform matrix Uft(n b ) i n a very efficient way, as a sequence of order(iV£ 2 ) ele- 
mentary operations. His decomposition will henceforth be called the quantum Fast 
Fourier Transform (qFFT). Another more difficult desideratum for a quantum com- 
piler is that it should decompose Uft{n b ) into a sequence of order(iV B 2 ) elementary 
operations. Qubiter does this too. 

Numerical evidence that Qubiter can compile H^ B and Uft(n b ) for Nb = 
1, 2, 3, 4 in this ideal way was reported in Tuc99. The goal of this paper is to explain 
analytically why Qubiter behaves in this ideal way. Qubiter does not behave this 
way because it is hardwired to recognize H^ B and Uft(n b )- Such a highly specialized 
approach would be of limited scope. Instead, the reason it behaves this way is because 
efficient expansions of both H^ B and Uft(n b ) can both be viewed as special cases of 
re-CSD, and re-CSD is Qubiter's specialty. This is a promising result. It hints that 
re-CSD is a door to compiling efficiently a large class of unitary matrices that includes: 
Hn b , Uft{n b ), and an infinitude of other matrices. 

2 Notation 

In this section, we will introduce some notation that is used throughout this paper. 
For additional information about notation, the reader is referred to Ref. [3]. Ref. [3] is 
a review paper by the author of this paper that uses the same notational conventions 
as this paper. 

For integers a, b such that a < b, let Z a b = {a, a + 1, . . . , b — 1, b}. We will 
often use Nb to denote the number of bits in a quantum register, and N$ = 2 Nb to 
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denote the corresponding number of states. 

First, let us introduce the members of our cast of characters that are 2d ma- 
trices. The Pauli matrices are 

(1) 

This drama will also feature the 2d identity matrix and the one-bit Hadamard matrix: 
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Note that H is related to e % 1 a v, a n/2 rotation about the Y axis, as follows: 



e l 4 a y = cost — ) - i(j v sm( — ) = —= 

4 y 4 y/2 



1 -1 
1 1 



Ha z . 



(2) 



(3) 



Of course, matrices of dimension greater than 2 will also make an appearance 
in this play. Some will be built by using the tensor product and direct sum of matrices. 
In particular, for any matrix A, we can tensor-multiply or direct-sum several copies 
of A. For any positive integer r, let 



A® r = A ® A <g> . . . ® A , 



r copies of A 



and 



A® A< 



A 



(4) 



(5) 



r copies of A 

For example, I® r <S> A = A® 2r . A fact that will be useful later on is that for any two 
matrices A, B, the transpose operation distributes over © and <g>: (A®B) T = A T @B T 
and (A ® B) T = A T ® B T . 

For any positive integer r, let D r : 



D r — (Ha z ) <S> J®*" -1 = — 



_ J®r— 1 
J®) — 1 



(6) 



where J®° = 1. Note that D r is 2 r dimensional. 

The main theme of this drama is the recursive application of the CSD (re-CSD 
or reap-CSD). The CSD (in the form used here) is defined as follows. Given a unitary 
matrix U of even dimension N, U can be expressed as 



U= (Lq © Li)D(i2o 8 Ri) , 



(7) 



with 



D 



C S 

-s c 



(8) 
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Figure 1: Diagrammatic representation of Cosine-Sine Decomposition. 




Figure 2: Definition of black-filled nodes. 

and 

C 2 + S 2 = 1 , (9) 

where Lq, Li, Rq, R\ are unitary matrices of dimension N/2, and where C and S are 
real diagonal matrices. Eq.(J7J) is represented diagrammatically in Fig^ Matrix D is 
assigned to the node, matrix U is assigned to the incoming arrow, matrices L ©Li and 
Rq © Ri are each assigned to an outgoing arrow. When the CSD is used recursively, 
then the various applications of CSD can each be represent as in FigQand connected 
to form a CSD binary tree. We will refer to the matrix assigned to the arrow entering 
the root node of the CSD tree as the initial matrix, Ui n , of the tree. FigO shows 
another convention for CSD trees that will be used here. Namely, a black-filled node 
will represent a node that is assigned the same matrix that is assigned to the node's 
single incoming arrow. 

In this paper, we are mostly concerned with "degenerate" CSD trees that 
have been pruned so that they contain only the leftmost or rightmost branches (See 
Fig|HJ). As discussed in most books about programming algorithms, there are several 
algorithms for traversing all the nodes of a tree. The algorithm that will be used 
in this paper (also used by Qubiter) is one of the most common, and is called the 
in-order tree transversal strategy. In this strategy, one visits (1) the left sub- 
tree, (2)the root node, (3) the right sub-tree, in a recursive manner. If one lists, in 
accordance with the in-order strategy, the node labels of either of the two trees in 
FigE3 one obtains for both trees: 

U in = X 1 X 2 X 3 ...X 9 . (10) 
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Figure 3: CSD trees pruned so that they contain only leftmost or rightmost branches. 

Henceforth we will refer to the left and right hand side trees of FigEI as the uphill 
and downhill trees, respectively. 

3 Hadamard Matrices 

In this section, we will consider re-CSD with initial matrix equal to the A^-bit 
Hadamard matrix H® Nb . This problem is closely related to the one considered in 
the next section, re-CSD with initial matrix equal to the A^-bit Discrete Fourier 
Transform matrix. For simplicity, we will assume that Nb = 4. How to generalize 
our results from Nb = 4 to arbitrary Nb will be obvious. 

FigE] shows the CSD tree that is produced by Qubiter when the initial matrix 
is if® 4 . We will spend the remainder of this section explaining Fig0] (As discussed in 
Tuc99, the CSD is not unique. Due to this non-uniqueness, there are many possible 
CSD trees that can be produced from compiling the same initial matrix if® 4 . Fig0] 
is just one of these possibilities. Tuc99 discusses what choices must be made in order 
to steer Qubiter towards producing this particular tree.) 

We can express the initial matrix if® 4 as a product of one-qubit Hadamard 
matrices: 

H ® 4 = H (3)if (2)ff (l)ff (0) . (11) 
Eq. ffTTj) can be expressed recursively as 

ff® 4 = T(3210) , (12a) 
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Figure 4: re-CSD with initial matrix equal to the 4-bit Hadamard matrix. 



r(3210) = (#a 2 )(3K(3)r(210) 

r(2io) = (#<7 z )(2)<7,(2)r(io) , 

T(10) = (Ha g )(l)a z {l)T(0) , 



(12b) 
(12c) 
(12d) 



and 



T(0) = (Ha z )(0)a z (0) . (12e) 

It is convenient to translate the various bit-labelled operators in Eqs. (|12|) into 
matrices. Define T r for r e Zq^ by 



r(32io) = r 4 , 
r(2io) = /®r 3 , 
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(13a) 
(13b) 



r(io) = /® 2 ®r 2 , 



(13c) 



and 



r(o) = i m ® ri 



1 . 



(13d) 



(13e) 



Note that for r e Zq^ , T r is a matrix of dimension 2 r . In fact, T r = H® r . The 
bit-labelled operators (Ha z )(a) can be expressed in terms of the D r matrices defined 
by Eq.©: 



(Ha z )(3) = D 4 , 



(14a) 



(Ha z )(2) =/(g) D 3 



(77^(1) = /® 2 ®D 2 , 



and 



(#<7*)(0) = !® 3 ®£>i 
Likewise, the bit-labelled operators a z (a) can be expressed as matrices: 



(14b) 
(14c) 

(14d) 

(15a) 
(15b) 
(15c) 

a 2 (0)=7® 3 ®a,. (15d) 

After replacing bit-labelled operators by their matrix equivalents via Eqs. (jl3|) . (fPfj) 
and (fT5|). the recursion relation defined by Eqs. (fT2j) becomes simply: 



^(3) = 
<x z (2) = 7c 



j®3 Q 

-7® 3 







-I® 2 

I 
-7 



and 



T r 

o -r r 



(16) 



for r G Z 0i 



3- 
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The downhill tree of FigE] was obtained using re-CSD in combination with 
Eq.(fTB|) and the following identities: 



a z <8> H® 3 = 

H® 3 <g> {-H® 3 ) , (17a) 
a® 2 ® H® 2 = 

H® 2 ® (-#® 2 ) ® (-# 02 ) ® H® 2 , (17b) 
H ® ® ® # ® (-if) ®H®H® (-H) , (17c) 



and 



o" 2 = 

<£> (— C*) ® (—&z) ® cr Z ® (— Cz) ® (g) cr z ® (— (7 Z ) . (17d) 

Listing the node matrices of the downhill tree given by Fig0] (listing them in 
the order visited by an in-order tree transversal) gives: 

H® 4 = D 4 DfD® A Df 8 a® 4 . (18) 

Using re-CSD, Qubiter expresses H® 4 in the form given by the right hand side of 
Eq.([18p. Then it recognizes that: (l)the matrices of the form Df s are one-qubit 
Y-axis rotations, and (2) the matrix a® 4 is a a z matrix applied separately to each 
qubit. 

Note that Eq. ljllj) listed the mutually commuting operators {H(a) : a G Z 3 } 
in one of 4! equivalent orders. If we take the transpose of both sides of Eq. (Jll)) . we 
reverse the order of the H(a) operators: 

H® 4 = H(0)H(1)H(2)H(3) . (19) 

In fact, we can take the transpose of all equations between and including Eqs. (jll|) to 
Eqs.()18|). In particular, we get 



1 r+1 



o 

o -rT 



for r G Z 0j s- Also, 



H m = a ®\ D T) m {DT) m {Dl) m DT . (21) 

Eq. (J21|) also follows if we list the node matrices of the uphill tree given by Fig0] 
(listing them in the order visited by an in-order tree transversal). 
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4 Discrete Fourier Transform Matrices 



■ 2wxy 



In this section, we will consider re-CSD with initial matrix equal to the iV^-bit 
Discrete Fourier Transform matrix, defined by {Uft)x,v '■ 
Zq,n s -i- F° r simplicity, we will assume that N B = 4. 



e N s , where x, y G 




(A 2 r 2 eB 2 r 2 ) 



(A iri eB iri ) T 7 \(A 2 ©B 2 f 2 




(A^QB^) 



Figure 5: re-CSD with initial matrix equal to the 4-bit Discrete Fourier Transform 
matrix. 



FigEJ shows the CSD tree that is produced by Qubiter when the initial matrix 
is Uft for Nb = 4. We will spend the remainder of this section explaining Fig0 

In Ref. [3] , Coppersmith showed how to express Uft as a sequence of order(iV s 2 ) 
elementary operations. We will call his decomposition the quantum Fast Fourier 
Transform (qFFT). For a pedagogical discussion of the qFFT circuit and related 
matters, see Ref.^j. Ref.jl] has the virtue that it uses the same notation as this 
paper. 
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Define the root of unity on and the 2d matrix by 



to = exp(z 



2tt 

iVV 



Q = diag(l, to) 



(22) 



where n is the number operator. 

As in Ref.[4j, for any 2 distinct bits a, (3 G Z 0i n b -i, let us define an operator 
V(a,P)by 



V(a, (3) = exp[in 



n(a)n(/3). 



(23) 



As in Ref.jl], R will denote the bit reversal matrix. For Nb = 4, it maps bits 
0— >3, 1— >2,2— >1, and 3 — > 0. It can be expressed in terms of exchange operators, 
which in turn can be expressed in terms of CNOTs. 

As shown in Ref.jU, the qFFT for N B = 4 is 



U FT = H(3)V(3, 2)V{3, 1)V{3, 0)H(2)V{2, 1)V{2, 0)H(1)V(1, 0)H(0)R . (24) 
A diagrammatical way of saying the same thing is: 



U. 



FT 



H 



H 



H 



H 



R 



(25) 



Note that Eq.(J2H) for Uft and Eq. (fTT|) for H m are very similar. They differ in that 
the expression for Uft contains, in addition to the one-bit Hadamard matrices, the 
bit reversal matrix R, and diagonal matrices inserted between the one-bit Hadamard 
matrices. It is convenient at this point to lump together the diagonal operators that 
occur between the one-bit Hadamard matrices. Define diagonal operators A(-) by 



A(3210) = a z (3)V(3, 2)V(3, 1)V(3, 0) 



(26a) 



A(210) = a z (2)V{2, 1)1/(2,0), 



A(10)=<t z (1)F(1,0) 



(26b) 
(26c) 



and 



A(0) = <j z (0) . 



(26d) 
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Next, let us consider re-CSD with the initial matrix UftR- Eq.([24p can be 
expressed recursively as 



U FT R = T(3210) , (27a) 

T(3210) = (Ha z ) (3) A(3210)r(210) , (27b) 

T(210) = (#<r*)(2)A(210)r(10) , (27c) 

r(io) = (#^)(i)A(io)r(o) , (27d) 

T(0) = (#<t,)(0)A(0) . (27e) 

It is convenient to translate the various bit-labelled operators in Eqs. (|27)l into 
matrices. Define T r for r £ Zo,4 by 

T(3210) = T 4 , (28a) 



and 



r(2io) = /®r 3 , 
r(io) = / 02 ®r 2 



(28b) 
(28c) 



r(o) = i® 3 ® r x , (28d) 

and 

r = 1 . (28e) 

Note that for r e -^0,4? T r is a matrix of dimension 2 r . The bit-labelled operators 
(Ha z )(a) can be expressed in terms of the D r matrices defined by Eq.©: 

(Ha z )(3) = D 4 , (29a) 



{Ha z ){2) = I®D 3 , 



and 



(29b) 
(29c) 



(29d) 
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The bit-labelled operators A(-) can be expressed in terms of the Q matrix defined by 



A(3210) 



a(2) 



■>(!) 
4 



lM]n(3) 



a z (3)e l - ^ 
cr 2 (3)u/^ [4n(2)+2n(1)+n(0)ln(3) 

-fi 4 © tt 2 © ft 

A 3 © 5 3 , 



(30a) 



A(210) 



rn(l) , n(0). 



a 2 (2)e mF ^ + ^ ln(2) 
o- 2 (2)cu i7r[4n(1)+2n(0)]n(2) 

-ft 4 © ft 2 
/ © (A 2 © 5 2 ) , 



(30b) 



A(10) 



n(l) 



(7 z (l)^ [4n(0)]n(1) 









-ft 4 



I m © (Ai © B 1 



(30c) 



and 



A(0) 



J®3 

/® 3 © (A 



Bn 



(30d) 



The matrices A r and I? r , where r G Zo,3> are first mentioned in Eqs. (j30|) . They 
are implicitly defined by those equations. Note that for r e -Zo^, A r and _B r are 
diagonal unitary matrices of dimension 2 r . After replacing bit-labelled operators by 
their matrix equivalents via Eqs.(j2HJ), and (03), the recursion relation defined by 
Eqs.fl2Tj) becomes simply: 



D r 



+i 



A r T T 








(31) 



for r G Z 0: , 
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The downhill tree of FigEl was obtained using re-CSD in combination with 
Eq.flHU). 

Listing the node matrices of the downhill tree given by Fig|5] (listing them in 
the order visited by an in-order tree transversal) gives: 



U FT R = T 4 = D 4 (A 3 ® B 3 )Df 2 (A 2 © B^D^Ax © B^a 



©4^ ©8 



(32) 



Qubiter first finds the necessary permutation R using a strategy to be discussed in 
the next section. Then Qubiter uses re-CSD to express UftR m the form given by 
the right hand side of Eq. (J32|) . Then it recognizes that: (1) the matrices of the form 
Df s are one-qubit Y-axis rotations , (2) the diagonal matrices <r z m and (A r © B r )® s 
are expressible as products of doubly controlled phase factors (such as V ). 

If we take the transpose of both sides of Eq.(|24|). we reverse the order of all 
the operators on the right hand side: 



Uft = RH{0)V{1, 0)#(1)V(2, 0)V(2, 1)#(2)V(3, 0)V{3, 1)V(3, 2)H(3) . 
Diagrammatically, 



U. 



FT 



R 



H 



H 



H 



H 



In fact, we can take the transpose of all equations between and including Eqs. 
Eqs. (l32|) . In particular, we get 



1 r+l 



r;Mr o 

o T T r B r 



D 



r+l 



for r e Z 03 . Also, 



(33) 



(34) 



to 



(35) 



RU FT = T'i = a z m (A t © B x )®\Dl)®\A 2 © B 2 )^(D'^(A 3 © B 3 )D 



2/ n T\©2/ 



(36) 



Eq . (J36|) also follows if we list the node matrices of the uphill tree given by FigJS] 
(listing them in the order visited by an in-order tree transversal). 

5 Bit Permutations Before Each CSD 

In decomposing Uft via re-CSD, we first pre or post multiplied Uft by the bit reversal 
matrix R. This example makes it clear that in using re-CSD, before each application 
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of the CSD, it is helpful to permute either the rows or the columns (or both) of the 
input matrix U in FigQ Only a certain type of permutation will work; it must also be 
a bit permutation, so that it can be expressed as a product of bit exchange operators, 
which in turn can be expressed as a product of CNOTs. But what is a good strategy 
for choosing such a permutation? 

Decompositions such as the ones given here for H® A and Uft come from CSD 
trees that contain only a single branch. To promote the growth of such degenerate 
trees, we want to stunt the growth of either the left matrices Lq, Li or the right 
matrices Ro, R\ in FigJTJ The left matrices are "stunted" if L and L\ are diagonal 
matrices. Likewise, the right matrices are "stunted" if Ro and R\ are diagonal. 

The general CSD is 



u Q0 


Uoi ' 




' L 







c 


s ' 




Ro 





u 10 


U u 











-s 


c 







Ri 



_ LqCRq LoSRi 

When the left matrices L , L x are diagonal, UqqU\ and UqiU^ are diagonal matrices. 
Analogously, when the right matrices Rq,Ri are diagonal, C/oo^qi an d ^10^11 are 
diagonal matrices. So probably a good strategy for selecting a row and/or column 
permutation to perform before each CSD is to find a bit permutation that minimizes 
the absolute value of the off-diagonal elements of Uo Ul Q © t/oi^n an d/or L^o^oi ® 
UioUli. The idea is to coax the growth of sparse trees that have only a few branches 
(rightmost and leftmost branches might or might not be included in such a tree.) 
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